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Abstract 

The goal of this paper is to derive a simple recursion that generates 
a sequence of fractions approximating yfk with increasing accuracy. 
The recursion is defined in terms of a series of first-order non-linear 
difference equations and then analyzed as a discrete dynamical sys- 
tem. Convergence behavior is then discussed in the language of initial 
trajectories and eigenvectors, effectively proving convergence without 
notions from standard analysis of infinitesimals. 



1 



1 Introduction and motivation 



Consider for a moment the simple recursion 

xt+i _ x t + 2y t 
y t +i x t + y t 

Choose initial values to set it marching on its way towards y2. If we choose 
initial values xq = yo = 1> the recursion ([1]) gives a sequence of fractions 
approximating y2 whose behavior is summarized in the table below. 



t 


xt/yt 







1/1 


1 


1 


3/2 


1.5 


2 


7/5 


1.4 


3 


17/12 


1.4167... 


4 


41/29 


1.41379... 


5 


99/70 


1.41429... 



For reassurance that the recursion generates a sequence that does infact 
converge to y/2, employ the following analysis: For some sequence at, 
if at — > L, then at+i — > L as well. So suppose that the recursion (pQ) has 
limit L. Multiply the top and bottom of the right hand side of (pQ) by 1 /yt 
and we get 

l 

x t +\ _ x t + 2y t ^£ 
yt+i xt + yt i" 

Using the previous fact about limits, we have 

L + 2 



L 



L + l 



This gives L = ±y/2. In this analysis, we have picked up the unsettling 
possibility of this recursion generating a sequence of fractions converging to 
—y/2. Discussion of which initial values generate such a sequence is withheld 
momentarily. 
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2 y/k Recursion 



It is convenient to consider recursion (pQ) as an action on a system of first 
order linear difference equations given by 

ix t+ i = x t + 2y t , ^ 

\yt+i = x t + yu 

which is a discrete dynamical system, or "DDS" . Clearly, if we replace 2 
by a positive integer k G Z + we obtain a recursion similar in structure to 
recursion ([1]) converging to ±v& for any initial values xo,yo- This recursion 
is 

x t +i _ x t + kyt 
y t+ i xt + yt ' 
This gives a corresponding DDS 



(3) 



xt+i = x t + ky t , ^ 
y t +i = xt + yt, 



which can be represented in matrix form as 

'x t+ x\ fl k\ (x t 



yt+ij U 1 J Kyt,' ' 



Any term in a sequence generated by ([5]) is generalized as 

(6) 



x t \ (1 k\ 1 ( xq 



Recursion ([3|) is recovered by taking ratios of terms with equal indices. 

3 \fk Recursion 



A natural question at this point would be "Does there exist a structurally 
simple recursion similar to ([3]) generating a sequence of fractions approxi- 
mating yfk ?" , to which the answer is "kind of . In the spirit of the previous 
analysis, start out with 

L = y/k 
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so that 

L n = k. 

Add L to both sides to get 

L n + L = L + k, 

then factor out an L: 

L{L n - 1 + 1) = L + k. 
Now divide both sides by L n ~ l + 1 to get 

^ L + fc 
~ L™" 1 + 1" 

Use the fact about sequences to obtain 

x t +i = x t + ky t 

yt+i 

which has the corresponding DDS 



xt + ky t 



+ y t 

Vt 




(7) 



(8) 



This latter system is not representable as a simple 2x2 matrix with real 
entries for arbitrary n. This is because there are now three different terms 
in this system: xt, yt, and x™ _1 /y" _2 which is non-linear. 



4 Convergence behavior of the DDS given by II 



We are now in a position to analyze (jHJ) using matrix methods. Suppose there 
did exist an n x n matrix, call it H n , such that (jHI could be represented as 







( ^0,l\ 


Xt,2 




XQ,2 


= IT 


\Xt,n) 




\XQ,nJ 



(9) 
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where x a ^ denotes the 6th entry of the vector corresponding to a. Or equiv- 
alently, Rt = IL^Ro. But what would the matrix U n look like? Consider 
the properties of (|8j) which II n must capture in order to faithfully represent 
convergence behavior of ([7]). First, recognize that the action on ([8]) needed 
to arrive at ([7]) is taking ratios of terms with equal indices, which is equiv- 
alent to taking the ratio of successive entries of a vector Rt. The entries in 
an arbitrary vector Rt must tend toward those of the dominant eigenvector 
of n n , Xd- So the ratio of any two successive entries in A^ should be equal 
to \fk. That is, \d,i/^d,i+i = y/k where 1 < i < n — 1 and A^j is the ith 
entry of the dominant eigenvector of II n . Such an eigenvector looks like 



V 



/ 



Recovery of the dominant eigenvalue from the dominant eigenvector is had 
by solving for A^ in (n n — Arf/ n )Arf = where is the column vector consisting 
entirely of zeros. This calculation yields A^ = 1 + {Vk. 

Now suppose II n is diagonalizable. Then U n admits a basis for M. n 

consisting entirely of eigenvectors of Tl n . So any initial vector Rq in R n can 

-> n -» 

be written as a linear combination of eigenvectors Rq = CjAj. 

i=i 



Applying n n to our initial vector Rq, 

n 

R\ = n n ^o = n n cjAj 

i=l 

= ciII n Ai H h c n U n \ n 

= C1A1A1 + • • • + c n \ n \ n . 

Applying Ii n again, 

R 2 = n 2 n Ro = U n {U n R ) = n n (ciAiAi + • • • + c n A n A n ) 

= ciAin n Ai + • • • + c n A n II n A n 
= ciA^Ai + • • • + c n A^A n . 
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We can see the pattern now, 



Rt — — ciAi*Ai + • • • + c n A n *A n — CjA*Aj 
Verify that indeed, 



(10) 



i=i 



Rt — n^-Ro 



( ClAl*Al j l + • • • + C n \n\ n ^\ 

c\\\ \\ 2 H h c n A n *A n 2 



where X a ^ denotes the 6th entry of the eigenvector corresponding to A a . 

Note that with increasing t, only one of the CjA|Aj i terms becomes the 
dominant term. The dominant term is the one involving the dominant 
eigenvalue, A^ and the entries of its corresponding eigenvector, A^j. Since 
contributions of the other terms become negligible in the limiting quotient, 
we can make the following statement: 



R t = U^Ro « cX d X d = c(l + \fkf 



n-2 



for t > 1. 



It follows that 



R 



t.i 



(11) 



V i / 

c(l + s/kfX^i H h CnXjX^j 

R t ,i+i c(l + y/kfXd 

^ C(l + VkfXdj _ X d ,j _ 

c(l + \/kyXd,i+i \d,i+i 
for (1 < i < n - 1) and t > 1. 



When n = 2 (finding square roots), let c = and notice that (jlip is 
approximately — y/k. To satisfy our curiosity from Section 1, if we are to 
have a sequence generated by ([2]) converging to — \fk then the appropriate 
initial values are ones which are components of some multiple of the second 
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eigenvector, not a linear combination of the dominant eigenvector and the 
other. This way, ratios of successive entries of an evolving vector equal the 
slope of the second eigenvector, —yk. If we restrict our choices of initial 
values to Q 2 , then we do not run into this problem of multiple limits. 



5 Derivation of the matrix II 



In the preceding section we showed that if (|8|) can be represented as ([9]), 
then limt_>._|_ 00 xt,i/xt,i+i = tyk, as desired. It only remains to find the exact 
form for U n . In the analysis above, we showed that long-term time evolution 
of an initial vector depends heavily on "hitting" with its corresponding 
eigenvalue A^, which in turn depends on left-multiplying Rq by n„. Carrying 
this calculation out gives 







((ft) 






(*E) 















^ n-V 

n-1 

) 

n— 3 
n— 4 



+ 



k \ 

n-l 



n-2 



n— 3 



Evidently, Tl n is such that when an initial vector Rq is left multiplied by 
it, returned is the dominant eigenvector plus another vector, v, along with 
other negligible terms in the limit that i > 1. This means that U n must 
be the sum of two matrices acting on the linear combination of eigenvectors 
that comprises Rq. 

TL n R = (I n + 7r„) (\ d \d H h A n A n ) . 

where I n is the n-dimensional identity matrix. But what is 7r n ? Consider 
what action is taken by n n to return v from A^. Apparently, TT n is an n x n 
matrix such that when it left multiplies a column vector, it has the effect of 
permuting entries by one place in a cyclic manner while scaling by a factor 
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of \fk. By inspection, we see that 



TT,, 



/0 

1 

1 

1 



\o o o 

The exact form of Il n is given by 

n n = I n + 7T n 



k\ 






1 0/ 



A 





•• 


• 


o\ 




/o 





•• 


• 


k\ 





1 


•• 


• 







1 





•• 


• 











1 •• 


• 










1 


•• 


• 











•• 


• 1 





+ 








1 •• 


• 










•• 


• 


V 









•• 


• 1 


0/ 


/I 





•• 


• 


k\ 












1 


1 


•• 


• 


















1 


1 •• 


• 





















1 •• 


• 















\o 





•• 


• 1 















6 Algebraic properties of the matrix II 



We must still confirm diagonalizability of U n since most of our case depends 
upon this property of Ii n . The characteristic polynomial, P(A), of H n can 
be found by computing det(n n — XI n ) by expanding in minors along the 
top row, giving P(A) = (1 — A) n + (— l) n+1 k. The eigenvalues are had 
by solving P(A) = giving A j = 1 — \fk e tJn / n where 1 < j < n and J = 
2j when n is even and 2j+l when n is odd. The largest of these eigenvalues 
is Ad = An = Xr^i = 1 + \fk as desired. These n distinct eigenvalues give 
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n distinct eigenvectors given by 



/ A,-l \ 
k 



\ 1 / 

the largest of which corresponds to the eigenvalue A^; this vector is 



y/k 
Vk 



n-2 
n— 3 
n— 4 



V 



1 



/ 



Since II gives n distinct eigenvalues and eigenvectors, Ii n is diagonalizable. 
We are now in the position to make the generalization of as follows 



Rt gives rise to lim _ 1,1 



(12) 



for 1 < i < n — 1 and t € Z. Equivalent ly, 



(I 





•• 


• 


k\ 


t 






/VA 


1 


1 


•• 


• 







X2 




x 2 





1 


1 •• 


• 







X3 




x' 3 








1 ■ ■ 


• 







£4 







so that lim 



t^+oo x' 



\0 
s/k. 



1 1/ \Xn) 



This result effectively fulfils the goal of the paper which was to derive a 
simple recursion that generates a sequence of fractions approximating \fk 
with increasing accuracy. 
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7 Computation 



The result of the previous section satisfies the technical goal of this paper, 
but it is left to the reader to judge the practicality of this result. Accuracy 
of an approximation depends on taking powers of an n x n matrix. This 
tedious task can be tiresome for even relatively small powers of n and t. So 
where do we look to find aid in this computation? One could certainly start 
with the Cayley-Hamilton Theorem, which states that every n x n matrix 
over a commutative ring satisfies its own characteristic equation, P(\) = 
det(A7 n — A). Applying this theorem to H n gives 

p(n n ) = (/„ - u n ) n + (-i) n+l k i n = o, 

where is the n x n matrix consisting entirely of zeros. The binomial 
theorem then gives 

V n!(-l)* j - ( _i)n kI 
Solving then for 11^ gives 

U n = E n 'iU n _^ U n + [("I)"" 1 + k) I n . (13) 
i=l ' K >' 

This is an explicit equation expressing II" in terms of lower powers of II n 
and I n . It is useful because if one is able to calculate powers of II n up to 
and including II™ -1 , then one is able to generate arbitrarily large powers of 
II n iteratively which then can be used to generate arbitrarily close approxi- 
mations to \fk. 



Let's take a look at the n = 2 case. Equation (|13p gives 

n| = 2n n + {k — 1)/ 2 , (u) 

which is an explicit expression of II2 in first powers of H2 and Because 
no higher powers of H2 need to be calculated to arrive at (|14p . arbitrary 
integer powers of H2 are gotten with ease from iterative multiplication and 
substitution of powers of H2. This gives rise to a Fibonaccidike sequence in 
the exponents of H2: 
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nl = 2U n + {k- \)i 2 
ni = n 2 n2 

= 2U 2 2 + (k - i)n 2 

= (k + 3)n 2 + 2(/c - l)/ 2 

ni = n^n 2 

= (/e 2 + 10A: + 5)n 2 + 4(k 2 - l)/ 2 

n^nf-nf- 2 . 

The reader is encouraged to try this for the n = 3,4,5,... cases to see 
that once harrowing computations are done to make II™ -1 known, precise 
approximate computation soon follows. 
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